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Abstract — Spectrum sensing, which aims at detecting spec- 
trum holes, is the precondition for the implementation of 
cognitive radio (CR). Collaborative spectrum sensing among 
the cognitive radio nodes is expected to improve the ability 
of checking complete spectrum usage. Due to hardware limi- 
tations, each cognitive radio node can only sense a relatively 
narrow band of radio spectrum. Consequently, the available 
channel sensing information is far from being sufficient for 
precisely recognizing the wide range of unoccupied channels. 
Aiming at breaking this bottleneck, we propose to apply 
matrix completion and joint sparsity recovery to reduce sensing 
and transmitting requirements and improve sensing results. 
Specifically, equipped with a frequency selective filter, each 
cognitive radio node senses linear combinations of multiple 
channel information and reports them to the fusion center, 
where occupied channels are then decoded from the reports 
by using novel matrix completion and joint sparsity recovery 
algorithms. As a result, the number of reports sent from the 
CRs to the fusion center is significantly reduced. We propose 
two decoding approaches, one based on matrix completion and 
the other based on joint sparsity recovery, both of which allow 
exact recovery from incomplete reports. The numerical results 
validate the effectiveness and robustness of our approaches. 
In particular, in small-scale networks, the matrix completion 
approach achieves exact channel detection with a number of 
samples no more than 50% of the number of channels in 
the network, while joint sparsity recovery achieves similar 
performance in large-scale networks. 

Keywords: Collaborative spectrum sensing, matrix comple- 
tion, compressive sensing, joint sparsity recovery. 



I. Introduction 

Ever since the 1920s, every wireless system has been 
required to have an exclusive license from the government in 
order not to interfere with other users of the radio spectrum. 
Today, with the emergence of new technologies which enable 
new wireless services, virtually all usable radio frequencies 
are already licensed to commercial operators and government 
entities. According to former U.S. Federal Communications 
Commission (FCC) chair William Kennard, we are facing 
with a "spectrum drought" JTJ . On the other hand, not every 
channel in every band is in use all the time; even for premium 
frequencies below 3 GHz in dense, revenue-rich urban areas, 
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most bands are quiet most of the time. The FCC in the United 
States and the Ofcom in the United Kingdom, as well as 
regulatory bodies in other countries, have found that most 
of the precious, licensed radio frequency spectrum resources 
are inefficiently utilized 0, 0. 

In order to increase the efficiency of spectrum utilization, 
diverse types of technologies have been deployed. Cognitive 
radio is one of those that leads to the greatest technological 
gain in wireless capacity. Through the detection and utiliza- 
tion of the spectra that are assigned to the licensed users 
but standing idle at certain times, cognitive radio acts as a 
key enabler for spectrum sharing. Spectrum sensing, aiming 
at detecting spectrum holes (i.e., channels not used by any 
primary users), is the precondition for the implementation 
of cognitive radio. The Cognitive Radio (CR) nodes must 
constantly sense the spectrum in order to detect the presence 
of the Primary Radio (PR) nodes and use the spectrum holes 
without causing harmful interference to the PRs. Hence, 
sensing the spectrum in a reliable manner is of vital im- 
portance and constitutes a major challenge in CR networks. 
However, detection is compromised when a user experiences 
shadowing or fading effects or fails in an unknown way. 
To get a better understanding of the problem, consider the 
following example: a typical Digital TV receiver operating 
in a 6 MHz band must be able to decode a signal level of 
at least -83 dBm without significant errors J4). The typical 
thermal noise in such bands is -106 dBm. Hence a CR which 
is 30 dBm more sensitive has to detect a signal level of - 
113 dBm, which is below the noise floor 0. In such cases, 
one CR user cannot distinguish between an unused band 
and a deep fade. In order to combat such effects, recent 
studies suggest collaboration among multiple CR nodes for 
improving spectrum sensing performance. 

Collaborative spectrum sensing (CSS) techniques are in- 
troduced to improve the performance of spectrum sensing. 
By allowing different secondary users to collaborate and 
share their information, PR detection probability can be 
greatly increased. CSS can be classified into two categories. 
The first category involves multiple users exchanging infor- 
mation 0, Q, and the second category uses relay trans- 
mission 0. Some recent studies on collaborative spectrum 



sensing include cooperative scheme design guided by game 
theory |5] and random matrix theory iflOll . cluster-based 
cooperative CSS ifTTl . and distributed rule-regulated CSS 
lfl2l ; studies concentrating on CSS performance improve- 
ment include lfl3l introducing spatial diversity techniques to 
combat the error probability due to fading on the reporting 
channel between the CR nodes and the central fusion center. 
There are also studies concerning other interesting aspects 
of CSS performance under different constraints iflOl . 03)- 
lfl6l . Very recently, there are emerging applications of the 
compressive sensing concept for CSS IfTTl . 

Existing literature mostly focuses on the CSS performance 
examination when the centralized fusion center receives and 
combines all CR reports. In an n channel cognitive radio 
network with m CR nodes, the fusion center has to deal with 
n * m reports and combine them wisely to form a channel 
sensing result. However,it is known that wireless channels 
are subject to fading and shadowing. When secondary users 
experience multi-path fading or happen to be shadowed, the 
reports transmitted by CR users are subject to transmission 
loss. As a result, in practice, no entire report data set is 
available at the fusion center. Besides, due to the fact that 
each cognitive radio can only sense a small proportion of 
the spectrum with limited hardware, each CR user gathers 
only very limited information about the entire spectrum. 

Contributions: 

We seek to release CRs from sending, and the central 
control unit from gathering, an excessively large number of 
reports, also target at the situations where there are only a 
few CR nodes in a large network and thus unable to gather 
enough sensing information for the traditional CSS. We 
propose to equip each cognitive radio node with a frequency 
selective filter, which linearly combines multiple channel 
information. The linear combinations are sent as reports to 
the fusion center, where the occupied channels are decoded 
from the reports by compressive sensing algorithms. As a 
consequence, the amount of channel sensing at CRs and the 
number of reports sent from the CRs to the fusion center are 
both significantly reduced. 

Following our previous work ||lin . |fl9ll on compressive 
sensing, we propose two approaches to collaborative spectral 
sensing. The first approach is based on solving a matrix 
completion problem lF20l - lF24l . which seeks to efficiently 
reconstruct a matrix (typically low-rank) from a relatively 
small number of revealed entries. In this approach, the 
entries of the underlying matrix are linear combinations of 
channel powers. Each CR node takes its local spectrum 
measurements, but instead of directly recording channel 
powers, it uses its frequency-selective filters to take p linear 
combinations of channel powers and reports them to the 
fusion center. The total p x m linear combinations taken by 
m CRs form apxm matrix at the fusion center. Considering 
transmission loss, we allow the the matrix to be incomplete. 
We show that this matrix is low-rank and has the properties 



enabling its reconstruction from only a small number of 
its entries, and therefore, information about the complete 
spectrum usage can be recovered from a small number 
of reports from the CR nodes. This approach significantly 
reduces the amount of sensing and communication workload. 

The second approach is based on joint sparsity recovery 
E5l - l|29l , which is motivated by the observation that the 
spectrum usage information the CR nodes collect has a 
common sparsity pattern: each of the few occupied channels 
is typically observed by multiple CRs. We develop a novel 
algorithm for joint sparsity signal recovery, which is more 
effective than existing algorithms in the compressive sensing 
literature since it can accommodate a large dynamic range 
of channel gains. 

In both approaches, every CR senses all channels (by 
taking random linear projections of the powers of all chan- 
nels), and the CRs do not communicate. While they work 
independently, their measurements are analyzed jointly by 
the detection algorithms running at the fusion center. There- 
fore, our approaches are very different from the existing 
collaborative spectrum sensing schemes in which different 
CRs are assigned to different channels. Our approaches move 
from collaborative sensing to "collaborative" computation 
and shift coordination from the sensing phase to the post- 
sensing phase. 

Our work is among the first that applies matrix completion 
or joint sparsity recovery to collaborative spectrum sensing 
in cognitive radio networks. Matrix completion and joint 
sparsity recovery are both being intensively studied in the 
compressive sensing community. We present them both 
because it is too early at this time to make a verdict of 
an eventual winner. 

The rest of this paper is organized as follows: In Section 
HH the system model is given. The matrix completion-based 
algorithm for collaborative sensing is described in Section 
Inil and the joint sparsity based algorithm is described in 
Section[lV] After that, in Section|V]we compare the two pro- 
posed approaches, discuss their computational complexity as 
well as filter design and dynamic update. Simulation results 
are presented in Section [VI] and conclusions are drawn in 

Section [yn] 

II. System Model 

We consider a cognitive radio network with m CR nodes 
that locally monitor a subset of n channels. A channel is 
either occupied by a PR or unoccupied, corresponding to 
the states 1 and 0, respectively. We assume that the number 
s of occupied channels is much smaller than n. The goal 
is to recover the occupied channels from the CR nodes' 
observations. Since each CR node can only sense limited 
spectrum at a time, it is impossible for limited m CRs to 
observe n channels simultaneously. 

To overcome this problem, we propose the scheme de- 
picted in Fig. Q] Instead of scanning all channels and 



sending each channel's status to the fusion center, using 
its frequency-selective filters, a CR takes a small number 
of measurements that are linear combinations of multiple 
channels. The filter coefficients can be designed and imple- 
mented easily. In order to mix the different channel sensing 
information, the filter coefficients are designed to be random 
numbers. Then, these filter outputs are sent to the fusion 
center. Suppose that there are p frequency selective filters 
in each CR node sending out p reports regarding the n 
channels. For the non-ideal cases, where we have relatively 
less measurements pm < n, i.e., the number of reports sent 
from all CRs is less than the total number of channels. The 
sensing process at each CR can be represented by a p x n 
filter coefficient matrix F. Let an n x n diagonal matrix R 
represent the states of all the channel sources using and 
1 as diagonal entries, indicating the unoccupied or occupied 
states, respectively. There are s nonzero entries in diag(R). 
In addition, channel gains between the CRs and channels are 
described in an to x n channel gain matrix G given by l30l : 

Gi,j = Pi(di,j)~ a ^ 2 \hij\ 



Unoccupied 



Unoccupied 



(1) 



where Pj is the i primary user's transmitted power, dij 
is the distance between the primary transmitter using j th 
channel and the i th CR node, a is the propagation loss 
factor, and hij is the channel fading gain. For AWGN 
channel, hij = l,Vi,j; for Rayleigh channel, \hi t j\ follows 
independent Rayleigh distribution; and for shadowing fading, 
\hij\ follows log-normal distribution ll30l . Without loss of 
generality, we assume that all PRs' use unit transmit power 
(otherwise, we can compensate by altering the corresponding 
channel gains). The measurement reports sent to the fusion 
center can be written as a p x m matrix 



M 



pxm 



(2) 



Note that due to loss or errors, some of the entries of M 
are possibly missing. The binary numbers on the diagonal 
of R are the n-channel states that we shall estimate from 
the available entries of M. 



III. CSS Matrix Completion Algorithm 

It is typically difficult for the fusion center to acquire all 
entries of M due to transmission failure, which means that 
our observation is a subset E C [p] x [m] of M. However, 
it is possible to recover the missing entries in M since it 
holds the following two important properties l20l required 
for matrix completion: 

1) Low Rank: rank(M) equals to s, which is the number 
of prime users in the network and is usually very small. 

2) Incoherent Property: Generate F randomly (subject 
to hardware limitation). From (Q~|) and the fact that R 
has only s nonzeros on the diagonal, M's SVD factors 
U, S, and V satisfy the incoherence condition 1231 . 
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Fig. 1. System model. 
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M is in general incomplete because of transmission 
failure. Moreover, each CR might only be able to collect 
a random (up to p) number of reports due to the hardware 
limitation. Therefore, the fusion certain receives a subset 
E C [p] x [in] of M's entries. We assume that the received 
entries are uniformly distributed with high probability^. 
Hence, we work with a model in which each entry shows 
up in E identically and independently with probability 
e/y/p x m. Given E pxm , the partial observation of M is 
defined as a p x to matrix given by 



M E = 



M^, if(i,j)GE 
0, otherwise. 



(3) 



We shall first recover the unobserved elements of M from 
M £ . Then, we reconstruct (RG T ) from the given F and M 
using the fact that all but s rows of (RG T ) are zero. These 
nonzero rows correspond to the occupied channels. Since p 
and to are much smaller than n, our approach requires a 
much less amount of sensing and transmission, compared 
to traditional spectrum sensing in which each channel is 
monitored separatively. 

In previous research on matrix completion lF2Tl - lF24l . it 
was proved that under some suitable conditions, a low-rank 
matrix can be recovered from a random, yet small subset of 

'Depending on the different channel gain, the CRs will select different 
coding/modulation/power control schemes so that the received signal to 
noise ratio can be maintained about a certain threshold. Due to this reason, 
we can assume that the loss of information is uniformly distributed. 



its entries by nuclear norm minimization: 

T||M||* + i |M 
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(4) 
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where ||M||* denotes the nuclear norm of matrix M and r is 
a parameter discussed in Section ITlI-CI below. For notational 
simplicity, we introduce the linear operator V that selects 
the components E out of a p x n matrix and form them into 
a vector such that ||7>M - VM E \\l = E^eE l M *o ~ 
MfJ 2 . The adjoint of V is denoted by V*. 

Recent algorithms developed for (0]i include, but not 
limited to, the singular value thresholding (SVT) algorithm 
i2T1l and the fixed-point continuation iterative algorithm 
(FPCA) l22l for fast completion of large-scale matrices (e.g., 
more than 1000x1000), a special trimming step introduced 
by Keshavan et al. in l23l . 

For our problem, we adopt FPCA, which appears to run 
very well for our small-dimensional tests. In the following 
subsections, we describe this algorithm and the steps we take 
for nuclear norm minimization. Also, we study how to use 
the approximate singular value decomposition (SVD)-based 
iterative algorithm introduced in l22l for fast execution. We 
further discuss the stopping criteria for iterations to acquire 
optimal recovery. Finally we show how to obtain R from 
the estimation M of M. 

A. Nuclear Norm Min. via Fixed Point Iterative Algorithm 
FPCA is based on the following fixed-point iteration: 

Y k = M fe - 5 k V*{VM k - VM E ) 



M k+1 = S TSk (Y k ) 



(5) 



where 5^ is step size and S a (-) is the matrix shrinkage 
operator defined as follows: 

Definition 1: Matrix Shrinkage Operator S a ( ■): As- 
sume M G Rpx™ and its SVD is given by M = 
Udiag(cr)V T , where U G W xr , a G W + , and V G K mxr . 
Given a > 0, S a (-) is defined as 

5 T (M) :=Udiag( SQ (a))V T (6) 

with the vector s a (a) defined as: 

s a (x) := m&x{x — a, 0}, component-wise. (7) 

Simply speaking, 5 T (M) reduces every singular values 
(which is nonnegative) of M by r; if one is smaller than 
a, it is reduced to zero. In addition, 5 Q (M) is the solution 
of 



min allXl 



1 



xgi 



M||| 



(8) 



where || • \\p is the Frobenius norm. 

To understand ©, observe that the first step of (|5]l is 
a gradient-descent applied to the second term in (0| and 
thus reduces its value. Because the previous gradient-descent 
generally increases the nuclear norm, the second step of 



© involves solving to reduce the nuclear norm of Y fc . 
Iterations based on © converge when the step sizes 6k are 
properly chosen (e.g., less than 2, or select by line search) 
so that the first step of <(3j is not "expansive" (the other step 
is always non-expansive). 

B. Approximate SVD Based Fixed Point Iterative Algorithm 

As stated in 11221 . the second step of (0 requires com- 
puting the SVD decomposition of Y fc , which is the main 
computational cost of (0. However, if one can predetermine 
the rank of the matrix M, or have the knowledge of the 
approximate range of its rank, a full SVD can be simplified 
to computing only a rank-r approximation to Y fe . Combined 
with the above fixed point iteration, the resulting algorithm is 
called fixed-point continuation algorithm with approximate 
SVD (FPCA). Specifically, the approximate SVD is com- 
puted by a fast Monte Carlo algorithm developed by Drineas 
et al. |3TI . For a given matrix A G M. mxn and parameters 
k s , this algorithm returns an approximations to the largest 
k s singular values corresponding left singular vectors of the 
matrix A in a linear time. 

C. Stopping Criterion for Iterations 

We tune the parameters in FPCA for a better overall per- 
formance. Continuation is adopted by FPCA, which solves a 
sequence of instances of (0J, easy to difficult, corresponding 
to a sequence of large to small values of r. The final r is 
the given one but solving the easier instances of (0} gives 
intermediate solutions that warm start the more difficult 
ones so that the entire solution time is reduced. Solving 
each instance of (0} requires proper stopping. Because our 
ultimate goal is to recover 0/1 values on the diagonal of R, 
accurate solutions of (0]i are not required. Therefore, we use 
the criterion: 

l|M fe+1 -M fe || P . , 



max{l,||M fc || F } 



< mtol 



where mtol is a small positive scalar. Experiments shows 
that le~ 6 is good enough for obtaining optimal R. 

D. Channel Availability Estimation Based on the Complete 
Measurement Matrix 

Since F has more columns than rows, directly solving 
X := RG T in <[TJ from given M is under-determined. 
However, each row Xi of X corresponds to the occupancy 
status of channel i. Ignoring noise in M for now, Xi contains 
a positive entry if and only if channel i is used. Hence, most 
rows of X are completely zero, so every column X. j of 
X is sparse and all X.j's are jointly sparse. Such sparsity 
allows us to reconstruct X from <[TJ and identify the occupied 
channels, which are the nonzero rows of X. 

Since the channel fading decays fast, the entries of X 
have a large dynamic range, which none of the existing 



algorithms can deal with well enough. Hence, we develop 
a novel joint-sparsity algorithm briefly described as follows. 
The algorithm is much faster than matrix completion and 
typically needs 1-5 iterations. At each iteration, every col- 
umn X.j of X is independently reconstructed using the 
model nim{J2iWi\Xij\ : FX.j = M.j}, where M.j is 
the jth column of M. For noisy M, we instead use the 
constraint \\FX.j — M.j\\ < a. The same set of weights wi 
is shared by all j at each iteration. Wi is set to 1 uniformly 
at iteration 1. After channel i is detected in an iteration, Wi 
is set to 0. Through Wi, joint sparsity information is passed 
to all j. Channel detection is performed on the reconstructed 
X.j's at each iteration. It is possible that some reconstructed 
X.j is wrong, so we let larger and sparser Xj's have more 
say. If there is a relatively large Xij in a sparse X.j, then i 
is detected. We have found this algorithm to be very reliable. 
The detection accuracy is determined by the accuracy of M 
provided. 

IV. CSS Joint Sparsity Recovery Algorithm 

In this section, we describe a new, highly effective algo- 
rithm for recovering 

Xfixm = Rrix?i X (G mX n) (10) 

and thus R by thresholding X. The algorithm allows but 
does not require the same F for all CRs, i.e., each CR can use 
a different sensing matrix F. The design of F is discussed 
in Section [V-CI below. 

In X, each column (denoted by X.j) corresponds to 
the channel occupancy status received by CR j, and each 
row Xi t . corresponds to the occupancy status of channel i. 
Ignoring noise for now, a row has a positive value (i.e., 
\Xi r \ > 0) if and only if channel i is used. Since there are 
only a small number of used channels, X is sparse in terms 
of the number of rows containing nonzero. In each nonzero 
row Xi r , there is typically more than one nonzero entry; in 
other words, if Xij 7^ 0, other entries in the same row are 
likely nonzero. Therefore, X is jointly sparse. In the case 
that the true X contains noise, it is approximately, rather 
than exactly, jointly sparse. 

Joint sparsity is utilized in our algorithm to recover X. 
While there are existing algorithms for recovering jointly 
sparse signals in the literature (e.g., in ll25l - ll27l ). our 
algorithm is very different and more effective for our un- 
derlying problem. None of the existing algorithms works 
well to recover X because the entries of X have a very 
large dynamic range because, in any channel fading model, 
channel gains decay rapidly with distance between CRs 
and PRs. Most existing algorithms are based on minimizing 
Tli ll^vllp f° r P ^ 1 an d P = oo. If p = 1, it is the same 
as minimizing the 1-norm of each column independently, so 
joint sparsity is not used for recovery. If p > 1 or p = oo, 
joint sparsity is considered, but it penalizes a large dynamic 
range since the large values in a nonzero row of X contribute 



superlinearly, more than the small values in that row, to the 
minimizing objective. In short, p close 1 loses joint sparsity 
and p bigger than 1 penalizes large dynamic ranges. Our 
new algorithm not only utilizes joint sparsity but also takes 
advantages of the large dynamic range of X. 

The large dynamic range has its pros and cons in CS 
recovery. It makes it easy to recover the locations of large 
entries, which can be achieved even without recovering the 
locations of smaller ones. On the other hand, it makes 
difficult to recover both the locations and values of the 
smaller entries. This difficulty has been studied in our 
previous work l32l . where we proposed a fast and accurate 
algorithm for recovering ID signals x by solving several 
(about 5-10) subproblems in the form of 

Truncated l\ minimization: min{^^ \x{\ : Ax = b} 

(ID 

where the index set T is formed iteratively as {1, ...,n} 
excluding the identified locations of large entries of x. 
With techniques such as early detections and warm starts, 
it achieves both the state-of-the-art speed and least require- 
ment on the number of measurements. We integrate the idea 
of this algorithm with joint sparsity into the new algorithm 
below. The framework of the proposed algorithm is shown 



Algorithm 1 Joint Detection Algorithm 
T<-{l,...,ra} 
repeat 

Independence recovery: 

X <- 

X.j <- :nin{V . , .\\., : AjX.j = b^X.j > 0} for 
every CR j with enough measurements (In presence 
of measurement noise, AjX.j = bj is replaced by 
\\AjX.j-bA\Ka) 
Channel detection: 

select trusted X.j and detect used channels from the 

selections 

Update of T: 

Update T according to detected channels and X 
until the tail of X is small enough 
Report X, and R by thresholding X 



in Table Q] At each iteration, every channel is first subject 
to independent recovery. Unlike minimizing J^i ll^v \\p> 
which ties all CRs together, independent recovery allows 
large entries of X to be quickly recovered. Joint sparsity 
information is passed among the CRs through a shared 
index set T, which is updated iteratively to exclude the used 
channels that are already discovered. Below, we describe 
each step of the above algorithm in more details. 

In the independence recovery step, for every qualified 
CR, a constrained problem in the form of ( fTTT i with con- 
straints AjX.j = bj in the noiseless case, or ||A,Xj — 



bj\\ < a in the noisy case, is considered, where a is an 
estimated noise level. As problem dimensions are small 
in our application, solvers are easily chosen: MATLAB's 
'linprog' for noiseless cases and Mosek 1331 for noisy cases. 
Both of these solvers run in polynomial times. This step 
dominates the total running time of Algorithm [T] but up to 
m optimization problems can be solved in parallel. Paral- 
lelization is simple for the joint-sparsity approach. At each 
outer iteration, all LPs are solved independently, and they 
have small scales relative to today's LP solvers, like Gurobi 
11341 and its MATLAB interface Gurobi Mex J35), where 
Gurobi automatically detects and uses all CPU and cores 
for solving LPs. CRs without enough measurements (e.g., 
most of their reports are missing due to transmission losses 
or errors) are not qualified for independent recovery because 
CS recovery is known unstable in such a case. Specifically, 
we require the number of the available measurements from 
each qualified CR to exceed twice as many as used channels 
or n - \T\. 

When measurements are ample, the first iteration will 
yield exact or nearly exact X.j's. Otherwise, insufficient 
measurements can cause a completely wrong X.j that 
misleads channel detection; neither the locations nor the 
values of the nonzero entries are correct. The algorithm, 
therefore, filters trusted X. .j's that must be either sparse 
or compressible. Large entries in such X. j's likely indicate 
correct locations. A theoretical explanation of this argument 
based on stability analysis for dTTb is given in l36l . 

Used channels are detected among the set of trusted X. j's. 
To further reduce the risk of false detections, we compute a 
percentage for every channel in a way that those channels 
corresponding to larger values in X and whose values are 
located in relatively sparser X.j's are given higher percent- 
ages. Here, relative sparsity is defined proportionally to the 
number of measurements; for fixed number of non-zeros or 
degree of compressibility, the more the measurements, the 
higher the relative sparsity. Hence, X.j corresponding to 
more reported CR j also tends to have a higher percentage. 
In short, larger and sparse solutions have more say. The 
channels receiving higher percentages are detected as used 
channels. 

The index set T is set as {1, ... , n} excluding the used 
channels that are already detected. Obviously, T needs 
to change from one iteration to the next; otherwise, two 
iterations will result in an identical X and thus the stagnation 
of algorithm. Therefore, if the last iteration posts no change 
in the set of used channels yet the stopping criterion (see next 
paragraph) is not met, the channels i corresponding to the 
larger ||Xi .||2 are also excluded from T, and such exclusion 
becomes more aggressive as iteration number increases. This 
is not an ad hoc but a rigorous treatment. It is shown in ll36l 
that larger entries in an inexact CS recovery tend to be the 
true nonzero entries, and furthermore, as long as the new T 
excludes more true than false nonzero entries by a certain 
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Fig. 2. False alarm and missing probability vs. sampling rate. 

fraction, (fTTT i will yield a better solution in terms of a certain 
norm. In short, used channels leave T, and in case of no 
leaves, channels with larger joint values ||Xi .||2 leave T. 

Finally, the iteration is terminated when the tail of X 
is small enough. One way to define the tail size of X is 
the fraction £\ £T \\ X i>- Hp/ zU^t II^vIU ie - the thought- 
unused divided by the thought-used. Suppose that T pre- 
cisely contains the unused channels and measurements are 
noiseless, then every recovered X.j in channel detection is 
exact, so the fraction is zero; with noise, the fraction depends 
on noise magnitude and is small as long as noise is small. 
If T includes any used channel, the numerator will be large 
whether or not X.j's are (nearly) exact. In a sense, the tail 
size measures how well X and T match the measurements 
b and expected sparseness. Unless the true number of used 
channels is known, the tail size appears to be an effective 
stopping indicator. 

V. Discussion 

A. Complexity 

In the worst case, algorithm Q] reduces the cardinality 
of T by 1 per iteration, corresponding to recovering at 
least 1 additional used channel. Therefore, the number 
of iterations cannot exceed the number of total channels. 
However, the first couple of iterations typically recover most 
of the used channels. At each iteration, the independence 
recovery step solves up to m optimization problems, which 
can be independently solved in parallel, so the complexity 
equals a linear program (or second-order cone program) 
whose size is no more than n. The worst case complexity is 
0(n 3 ) but it is almost never observed in sparse optimization 
thanks to solution sparsity. The two other steps are based 
on basic arithmetic and logical operations, and they run in 
0(pxn). In practice, algorithm[T]is implemented and run on 



a workstation at the fusion center. Computational complexity 
will not be a bottleneck of the system. As to the matrix 
completion algorithm, according to ll22l . FPCA can recover 
1000 x 1000 matrices of rank 50 with a relative error of 
10~ 5 in about 3 minutes by sampling only 20 percent of the 
elements. 

B. Comparisons between the Two Approaches 

The matrix completion (Section HlTT > and joint sparsity 
recovery (Section IIVI ) approaches both take linear channel 
measurements as input and both return the estimates of used 
channels. On the other hand, the joint sparsity approach takes 
the full advantage of F, so it is expected to work with smaller 
numbers of measurements. In addition, even though only 
one matrix completion problem needs to be solved in the 
matrix completion approach, it still takes much longer than 
running the entire joint sparsity recovery, and it is not easy to 
parallelize any of the existing matrix completion algorithms. 
However, in the small-scale networks, in cases where too 
much sensing information is lost during transmission or there 
are too many active PRs in the network, which increase 
the signal sparsity level, joint sparsity recovery algorithm 
with our current settings will experience degradation in 
performance. 

We, however, cannot verdict an eventual winner between 
the two approaches as they are both being studied and im- 
proved in the literature. For example, if a much faster matrix 
completion algorithm is developed which takes advantage of 
F, the disadvantages of the approach may no longer exist. 

C. Frequency-Selective Filter Design and Adaptive Sensing 

The proposed method senses the channels, not by mea- 
suring the responses of individual channels one by one, 
but rather measures a few incoherent linear combinations of 
all channels' responses through onboard frequency-selective 
filter set. The filter coefficients which perform as the sensing 
matrix should have entries independently sampled from a 
sub-gaussian distribution, since this is known to be best for 
compressive sensing in terms of the number of measurements 
(given in order of magnitude) required for exact recovery. 
In other words, up to a constant, which is independent 
of problem dimensions, no other type of matrix is yet 
known to perform consistently better. However, other types 
of matrices (such as partial Fourier/DCT matrices 11371 . 11381 
and other random circulant matrices 11391 ) have been either 
theoretically and/or numerically demonstrated to work as 
effectively in many cases. These latter sensing matrices are 
often easier to realize physically or electrically. For example, 
applying a random circulant matrix performs sub-sampled 
convolution with a random vector. 

Frequency-selective surfaces (FSSs) can be used to realize 
frequency filtering. This can be done by designing a planar 



periodic structure with a unit element size around half wave- 
length of the frequency of interests. Both the metallic and 
dielectric materials can be used. To deal with the bandwidth, 
unit elements in different shapes will be tested. 

D. Dynamic CS Update 

Channel occupancy evolves over time as PRs start and 
stop using their channels. Channel gains can also change 
when the PRs move. However, the CS research has so far 
focused on static signal sensing except the very recent path 
following algorithms in l40l . BTI . In the future work, we 
can investigate recovery methods for a dynamic wireless 
environment where based on existing channel occupancy 
information, an insignificant change of channel states can 
be quickly and reliably discovered. Given existing channel 
occupancy X, each new report, which is an entry Mi-j of M, 
is compared with (FX) — . If a significant number of such 
comparisons show differences, then there is a change in the 
true X. Since X = (RG T ), either R or G, or both, have 
changed. A change in R means new channel occupation or 
release. If R is unchanged, then those channel gains in G 
corresponding to occupied channels have changed. It is easy 
to deal with the latter case (i.e., G changed, but R didn't) 
and update the gains of occupied channels because it boils 
down to solving a small linear system. Let F and X denote 
the sub-matrices of F and X, respectively, formed by their 
columns and rows corresponding to the occupied channels. 
Then, the new gains are given in the least-squares solution 
of M = FX, where M shall include new reports arrived 
after the previous recovery/update but may still have missing 
entries. This system is easy to solve since the number of 
occupied channels is small. 

In a similar way it is easy to discover released channels as 
long as there is no introduction of new occupied channels. 
The release of channel i means row Xi of X turns into 
0, or small numbers. Therefore, one can solve the system 
M = FX and find the released channels, which correspond 
to the rows of X with all zero (or small) entries. When 
the system M = FX is inconsistent, it means that the 
received reports cannot be explained by the previously 
occupied channels, so there must be new channel occupation. 
Discovering new channel occupation is more difficult since it 
is to find changes in the previously unoccupied ones, which 
are much more than the occupied channels. However, it 
is computationally much easier than starting from scratch. 
Let X preu and X denote the previous and current channel 
information, respectively. Arguably, X pret , — X is highly 
sparse in the joint sense because only its rows corresponding 
to newly occupied or released channels can have large 
nonzero entries. Hence, X can be quickly recovered by 
performing joint sparsity recovery on X pret , — X over the 
constraints M = FX (or a relax version in the noisy case), 
a task that can be done by the algorithms for stationary 
recovery. 
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Fig. 3. POD vs. sampling rate. 



Fig. 4. Noiseless AWGN channel (no. of CR = 5). 



VI. Simulation Results 

The Probability of Detection (POD) and False Alarm Rate 
(FAR) are the two most important indices associated with 
spectrum sensing. We also consider the Miss Detection Rate 
(MDR) of the proposed system. The higher the POD, the 
less interference will the CRs bring to the PRs, while from 
the CRs' perspective, lower FAR will increase their chance 
of transmission. There is a tradeoff between POD and FAR. 
While designing the algorithms, we try to balance the CR 
nodes' capability of transmission and their interferences to 
the PR nodes. Performance is evaluated in terms of POD, 
FAR and MDR defined as follows: 

FAR=No. False /(No. False+No. Hit) 
MDR=No. Miss/(No. Miss+No. Correct) 
POD=No. Hit/(No. Hit+No. Miss) 

where No. False is the number of false alarms, No. Miss 
is the number of miss detections, No. Hit is the number of 
successful detections of primary users, and No. Correct is 
the number of correct reports of no appearance of PR. We 
define sampling rate as 

No. received measurements at the fusion center 
No. channels x No. CRs 
where (No. channel x No. CR) is the amount of total sensing 
workload in traditional spectrum sensing. 

A. Simulation of Matrix Completion Recovery 

According to FCC and Defense Advance Research 



Projects Agency (DARPA) reports J42J, 031 data, we chose 
to test the proposed matrix completion recovery algorithm 
for spectrum utilization efficiency over a range from 3% 
to 12%, which is large enough in practice. Specifically, the 
number of active primary users is 1 to 4 on a given set of 
35 channels with 20 CR nodes. 



Fig. |2] shows the false alarm and miss detection rates at 
different sampling rates for different numbers of PR nodes. 
Among all cases, the highest miss detection rate is no more 
than 5%, and this is from only 20% samples which are 
supposed to be gathered from the CR nodes regarding all 
the channels. When the sampling rate is increased to 50% 
and even when the channel occupancy is relatively high, 
i.e., 12% of the channels are occupied by the PRs, the miss 
detection rates can be as low as no more than 2%. From 
our simulation results, with a moderate channel occupancy 
at 9%, the false alarm rates are around 3% to 5%. Fig. [3] 
shows the probability of detection at different sampling rates. 
When the spectrum is lightly occupied by the licensed user 
at 3% channels being occupied, only 20% samples offer a 
POD close to 100%, and when there is a slightly raise in 
sampling rate, POD can reach 100%. In the worst case of 
12% spectrum occupancy, 20% sampling rate still can offer 
a POD of higher than 95%, and as the sampling rate reaches 
50%, POD can reach 98%. 

B. Joint Sparsity Recovery Simulation 

Joint sparsity recovery is designed for large scale appli- 
cation, and simulations carried out for a larger dimensional 
applications with the following settings: We consider a 20- 
node cognitive radio network within a 500 x 500 meter square 
area centered at the fusion center. The 20 CR nodes are 
uniformly randomly located. These cognitive radio nodes 
collaboratively sense the existence of primary users within 
a 1000 x 1000 meter square area on 500 channels, which 
are centered also at the fusion center. We chose to test 
the proposed algorithm for the number of active PR nodes 
ranging from 1 to 15 on the given set of 500 channels. Since 
the fading environments of the cognitive radio networks 
vary, we evaluate the algorithm performance under three 




Noiseless Rayleigh Fading Channel-nCR5 
POD vs. Sampling Rate 
— * # ♦ * * # *- 



- no-PR=1 
-no-PR=4 
-no-PR=5 
-no-PR=6 
-no-PR=7 
no-PR=8 
-no-PR=9 
-no-PR=10 



0.5 0.54 0.58 0.62 

Sampling Rate 
MDR vs. Sampling Rate 




0.74 0.78 0.8 



- no-PR=1 

- no-PR=4 

- no-PR=5 

- no-PR=6 

- no-PR=7 
no-PR=8 

- no-PR=9 
-no-PR=10 



0.38 0.42 0.46 



6^4 0*58 0*62 0^6 0^ 0^ 078 0.8 



Noiseless AWGN Channel-nCR10 
POD vs. Sampling Rate 




MDR vs. Sampling Rate 



0.03 

0.0 1 6 




♦ no-PR=1 _ 


0.016 




— 8— no-PR=4 


0.014 




A no-PR=5 
— 1 — no-PR=6 


0.012 
0.01 




— no-PR=7 

— • — no~PR=8 


0.008 




— B— no-PR=9 
if no-PR=10 


0.004 






0.002 






; : 


3 QM 0^8 0^2 0.46 6^ 0^4 0*56 062 0*66 0^ 


74 0^8 



Sampling Rate 



Sampling Rate 



Fig. 5. Noiseless Rayleigh fading channel (no. of CR = 5). 



Fig. 7. Noiseless AWGN channel (no. of CR = 10). 
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Fig. 6. Noiseless log-normal shadowing channel (no. of CR = 5). 



typical channel fading models: AWGN channel, Rayleigh 
fading channel, and lognormal shadowing channel. We 
first evaluate the POD, FAR, and MDR performance of the 
proposed joint sparsity recovery performance in the noiseless 
environment. Fig. |4j Fig. [5] and Fig. [6] show the POD, 
FAR and MDR performance at different sampling rate, for 
AWGN channel, Rayleigh fading channel, and lognormal 
shadowing channel, respectively, when small number of CR 
nodes sense the spectrum collaboratively. Fig. |7J Fig. [HJ 
and Fig. [9] show the POD, FAR and MDR performance at 
different sampling rate, for the aforementioned three types of 
channel models, when there are more CR nodes involved in 
the collaborative sensing of the spectrum. We observe that, 
log-normal shadowing channel model shows the best POD, 
FAR, and MDR performance no matter how many CR nodes 



are involved in the spectrum sensing. While the AGWN 
channel model shows the worst POD, FAR, and MDR 
performance. With respect to POD, the performance gap 
between these two models is at most 10%, which happens 
when the sampling rate is extremely low. For the Rayleigh 
fading channel model, when the number of samples is 62% 
of the total number of channels, for all tested cases we 
achieve 100% POD. If there are less active PR nodes in the 
network, smaller number of samples are required for exact 
detection. In essence, the proposed CCS system is robust to 
severe or poorly modeled fading environments. Cooperation 
among the CR nodes and robust recovery algorithm allow 
us to achieve this robustness without imposing stringent re- 
quirements on individual radios. We then evaluate the POD, 
FAR, and MDR performance of the proposed joint sparsity 
recovery performance in noisy environments. For all the 
simulations considering noise, we adopt the Rayleigh fading 
channel model. Fig. [10] and Fig. II II show the corresponding 
results. We observe that noise does degrade the performance. 
However, as shown in Fig. [TO] when the number of active 
PRs is small enough (e.g., no. of PR =1), even with signal 
to noise ratio as low as 15 dB, we still can achieve 100% 
POD with a sampling rate of merely 50%. Then with an 
increase in the signal to noise ratio, lower sampling rate 
enables more PR nodes to be detected exactly. Fig. QT| shows 
the POD, FAR and MDR performance vs. sampling rate at 
different noise level, each curve for a specific noise level is 
relatively flat (i.e., performance varies a little as sampling 
rate changes). This shows that the noise level has greater 
impact on the spectrum sensing performance rather than the 
sampling rate. At low noise level, e.g., SNR = 45 dB, 40% 
sampling rate enables 100% POD for 4 PR nodes. As SNR 
reduces to 15 dB, no more than 70% POD will be achieved 
even when the number of samples equals to the number of 




Fig. 8. Noiseless Rayleigh fading channel (no. of CR = 10). 



Fig. 9. Noiseless log-normal shadowing channel (no. of CR = 10). 



channels in the network. 

C. Comparison between Matrix Completion Algorithm and 
Joint Sparsity Recovery Algorithm 

For comparison, we applied joint sparsity recovery al- 
gorithm on a small-scale network with the same settings 
as we have used to test the matrix completion recovery. 
Instead of using a 500-channel network, we use a network 
with only 35 channels. Simulation results show that joint 
sparsity recovery algorithm performs better than the matrix 
completion algorithm in the following aspects: 

1) Faster computation due to lower computational com- 
plexity; 

2) Higher POD for the spectrum utilization rate between 
3% and 12% in the noise free simulations; 

To conclude, matrix completion algorithm is good for 
small-scale networks, with relatively high spectrum uti- 
lization, while joint sparsity recovery algorithm has the 
advantage of low computational complexity which enables 
fast computation in large-scale networks. 

VII. Conclusions 

In order to reduce the amount of sensing and transmission 
overhead of cognitive radio (CR) nodes, we have applied 
compressive sensing for collaborative spectrum detection 
in cognitive radio networks. We propose to equip each 
CR node with a frequency-selective filter, which linearly 
combines multiple channel information, and let it send a 
small number of such linear combinations to the fusion 
center, where the channel occupancy information is then 
decoded. Consequently, the amount of channel sensing at 
the CRs and the number of reports sent from the CRs to the 
fusion center reduce significantly. 
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Fig. 10. POD, FAR, and MDR performance vs. sampling rate at different 
SNR. 



Two novel decoding approaches have been proposed - one 
based on matrix completion and the other based on joint 
sparsity recovery. The novel matrix completion approach 
recovers the complete CR-to-center reports from a small 
number of valid reports and then reconstructs the channel 
occupancy information. The joint sparsity approach, on 
the other hand, skips recovering the reports and directly 
reconstructs channel occupancy information by exploiting 
the fact that each occupied channel is observable by multiple 
CR nodes. Our algorithm enables faster recovery for large- 
scale cognitive radio networks. 

The primary user detection performance of the proposed 
approaches has been evaluated by simulations. The results 
of random tests show that, in noiseless cases, the number 
of samples required are no more than 50% of the number 
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Fig. 11. POD, FAR, and MDR performance vs. noise level for different 
number of PR. 



of channels in the network to guarantee exact primary user 
detection for both approaches; while in noisy environments, 
at low channel occupancy rate, we can still have high 
probability of detection. 
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